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Abstract. In the present study we assessed the utility of H3-Genesequences for phylogenetic reconstruction of the Ile- 
terobranchia (Mollusca, Gastropoda). Therefore histone H3 data were collected for 49 species including most of the ma¬ 
jor groups. The sequence alignment provided a total of 246 sites of which 105 were variable and 96 parsimony informa¬ 
tive. Twenty-four (of 82) first base positions were variable as were 78 of the third base positions but only 3 of the se¬ 
cond base positions. 


H3 analyses showed a high codon usage bias. The consistency index was low (0,210) and a substitution saturation was 
observed in the 3 K ^ codon position. The alignment with the translation of the H3 DNA sequences to amino-acid sequences 
had no sites that were parsimony-informative within the Hcterobranchia. 


Phylogenetic trees were reconstructed using maximum parsimony, maximum likelihood and Bayesian methodologies. 
Nodilittorina unifasciata was used as outgroup. 


The resolution of the deeper nodes was limited in this molecular study. The data themselves were not sufficient to clar¬ 
ify phylogenetic relationships within Heterobranchia. Neither the monophyly of the Euthyneura nor a step-by-step evo¬ 
lution by the “basal" groups was supported. A conclusion about the monophyly of Opisthobranchia and Pulmonata could 
not be extracted from our data because we did not have any resolution at this point. 

We believe histone H3 alone provides no new marker for studying deep molecular evolution of the Heterobranchia due 
to the high grade of conservation and the low phylogenetic signal. 

Surprisingly there was a good resolution on the genera level. Analyses conducted with maximum parsimony and Bayesian 
inference (using all data) recovered all (or nearly all) genera mostly with statistically significantly supported nodes. Fur¬ 
ther studies focusing on the possible utility of histone H3 for the resolution of recent splits will be necessary. 

Keywords. Heterobranchia, Opisthobranchia, histone H3, molecular pliylogcny. 


1. INTRODUCTION 

Many questions regarding gastropod phylogeny have not 
yet been answered such as the molecular confirmation of 
the Heterobranehia eoneept based on morphological stud¬ 
ies from Haszprunar (1985. 1988). This taxon contains 
the Pentaganglionata Haszprunar, 1985 also known as 
Euthyneura Spengel, 1881 (with the Opisthobranchia and 
Pulmonata) and several mostly little known “basal” groups 
(e.g. Valvatoidea, Omalogyroidea, Arehitectonicoidea, 
Rissoelloidea and Pyramidelloidea) whieh present a step- 
by-step evolution towards the euthyneuran level of organ¬ 
isation (Haszprunar 1988). The hyperstrophy of the pro- 
toconeh is the most important autapomorphous character 
of the Heterobranchia. The Euthyneura are characterised 
by the presence of two additional (so-called parietal) gan¬ 
glia. However, the monophyly of the Euthyneura has not 
been clarified by moleeular studies, yet. In some studies 
they are recovered monophyletie (Colgan et al. 2000, 


2003; Knudsen ct al. 2006) in others not (Thollesson 
1999). The Pulmonata and Opisthobranehia can be sepa¬ 
rated by characters respective of the nervous system (pres¬ 
ence of a proeerebrum and ecrebral bodies in pulmonates 
and presence of a rhinophoral nerve in Opisthobranchia 
and Pyramidelloidea). However the moleeular confirma¬ 
tion regarding the monophyly of the Opisthobranehia 
(Vonnemann ct ah 2005; Grande et al. 2004a) and the 
Pulmonata (Tillier et al. 1996, Dayrat ct al. 2001) is still 
a matter of debate. There is no comprehensive investiga¬ 
tion concerning the “basal” groups. Only a few represen¬ 
tative taxa (e.g. Valvatoidea - Comirostra pellucida , Ar¬ 
ch iteetonieoidea - Philippea lutecu Pyramidelloidea 
Pyramided la dolabrata) have been included in current mo¬ 
leeular studies (Colgan et al. 2000; Grande et al. 2004a, 
2004b). 
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In recent years molecular systematic analyses in gas¬ 
tropods have utilised a variety of genetic markers, e.g. nu¬ 
clear 28S ribosomal RNA and/or 18S ribosomal RNA or 
mitochondrial 16S ribosomal RNA and/or cytochrome ox¬ 
idase subunit I (Tillier et al. 1994, 1996; Dayrat et al. 
2001; Vonnemann et al. 2005; Thollesson et al. 1999; 
Remigio & Hebert 2003). Nevertheless, new genetic 
markers are needed for the resolution of certain phyloge¬ 
netic relationships (especially regarding deeper nodes). 
Partial fragments of the gene coding for the extremely con¬ 
servative 113 protein (Maxson et al. 1983) were first used 
to clarify arthropod molecular evolution (Colgan et al. 
1998) and later polychaetc (Brown et al. 1999), gastro¬ 
pod (Colgan ct al. 2000,2003), polyplacophoran (Okusu 
ct al. 2003), cephalopod (Lindgren et al. 2004) and hexa¬ 
pod (Kjer et al. 2006) phylogeny. All studies used a com¬ 
bined dataset in their approaches. In their study of gas¬ 
tropod phylogeny, Colgan et al. (2000) did not find a 
monophyletic Heterobranehia while within the Euthyncu- 
ra, the Opisthobranchia arc paraphyletic with respect to 
the pulmonatcs. Very similar phylogenetic relationships 
were shown in Colgan et al. (2003). The Heterobranehia 
as well as the Opisthobranchia and Pulmonata arc rarely 
recovered as monophyletic in these studies. 


In the present study we wanted to test the utility of H3 
gene sequences for phylogenetic reconstruction within the 
Heterobranehia (focusing primarily on the Opistho¬ 
branchia). We were especially interested in testing 
whether H3 is suitable to resolve deeper nodes within het¬ 
erobranch phylogeny. Therefore, partial histone H3 data 
were collected for 49 species including most of the ma¬ 
jor groups (Euthyneura with Opisthobranchia and Pul¬ 
monata and “basaF groups with Valvatoidea, Architecton- 
icoidea, Omalogyroidea, Rissoelloidea and Pyramidel- 
loidea). 

2. MATERIALS AND METHODS 

2.1. Specimens and DNA extraction 

The studied taxa and the accession numbers are listed in 
Table 1. Twenty of the 49 sequences are taken from Gen- 
Bank. Opisthobranchia are represented by 26 species (in¬ 
cluding 11 suborders). Nodilittorina unifasciata 
(Caenogastropoda Cox, 1960) was used as an outgroup. 

DNA was extracted from ethanol-preserv ed individuals us¬ 
ing the DNeasy Tissue Kit from Qiagen (Hildcn, Ger¬ 
many). 


Table 1 . Taxonomic positions and collecting locations of the sampled taxa. Accession numbers of sequences included in the ana¬ 
lyses (ZSM = Zoologisehe Staatssammlung); published sequences taken from GenBank are marked with an asterisk. 


Major Taxon 

Species 

Locality 

GenBank Accession Number 

Caenogastropoda 
l ittorinoidea 
Eittorinidae 

Nodilittorina miifasciata (Gray, 1826) 

Genbank 

AF033705* 

Conidae 

Conns miles Linnaeus, 1758 

Genbank 

AF033684* 

Campanil idae 

Campanile symbolicmn Iredale, 1917 

Genbank 

AF033683* 

Opisthobranchia 

Nudibranchia 

Tethydidae 

Tcthvsfimbria Linne, 1767 

Blanes, Spain 

EF133468 

Discodorididae 

Discodoris atronwmlata (Bergh, 1880) 

Genbank 

DQ280013* 

Annin idae 

Annina neapolitana (Delle Chiaje, 1824) 

Banyuls-sur-Mer, France 

EF133469 

Pleurobranchoidea 

Pleurobranchidae 

Plenrohranchaca meckeli Leue, 1813 

Blanes, Spain 

EF133470 

Tylodinoidca 

Umbraculidae 

Uinbracnhnn uinhraculinn (Lightfoot, 1786) 

Atlantic Ocean, Meteor Bank 

EF133471 

Cephalaspidea 

Scaphandridae 

Scaphander lignarins (Linne, 1758) 

Blanes, Spain 

EF133472 

Philinidae 

Phi line aperta (Linnaeus, 1767) 

Genbank 

DQ093508* 

Gastropteridae 

Gastropteron meckeli Kosse, 1813 

Blanes, Spain 

EF133473 

Anaspidea 

Akeridae 

Akera hnllata Miiller, 1776 

Kattegat, Denmark 

EF133474 

Aplysiidae 

Aplysia calif arnica Cooper, 1863 

Miami, USA 

EF133475 

Aplysiidae 

Aplvsia cf. jidiana Quoy & Gaimard, 1832 

Genbank 

AF033675* 

Aplysiidae 

Bnrsatella leachii de Blainvilie, 1817 

Dingo Beach, Australia 

EF133476 

Thecosomata 

Cavoliniidae 

Clio pyramidata Linne, 1767 

Canary Islands; Spain 

EF133477 

Creseidae 

Creseis sp. 

Genbank 

DQ280012* 

Gyninosomata 

Pneumodermatidae 

Pnenmoderma cf. atlantica (Oken, 1815) 

USA, Atlantic 

EF133478 
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Major Taxon 

Species 

Locality 

Gen Bank Accession Number 

Sacoglossa 

Placobranchidae 

Elysia timida (Risso, 1818) 

Blanes, Spain 

EF133479 

Placobranchidae 

Elysia pitsilla (Bergh, 1872) 

Genbank 

DQ534792* 

Placobranchidae 

Elysia crispata Morch, 1863 

Genbank 

DQ534790* 

Placobranchidae 

Elysia viridis (Montagu, 1804) 

Genbank 

DQ534790* 

Cylindrobullidae 

Cylindrobulla beauii Fischer, 1857 

Florida, USA 

EF133480 

Acochlidia 

Hedylopsidae 

Hedvlopsis spiculijera (Kowalewsky, 1901) 

Rovinj, Croatia 

EF133481 

Microhedylidae 

Unela glandulifera (Kowalewsky, 1901) 

Rovinj, Croatia 

EF133482 

Architectibranchia 

Hydatinidae 

Micromelo undatus (Bmguiere, 1792) 

Genbank 

DQ093513* 

Acteonoidea 

Acteonidae 

Pupa solidula { Linne, 1758) 

Dingo Beach, Australia 

EF133483 

Acteonidae 

Rictaxis punctocaelatiis (Carpenter, 1864) 

Cayucos, Californica, USA 

EF133484 

Bullinidae 

Bullina lineata (Gray, 1825) 

Genbank 

AF033680* 

Pulmonata 

Systellommatophora 

Onchidiidae 

Onchidium sp . 

Genbank 

AF033706* 

Onchidiidae 

Onchidella floridana (Dali, 1885) 

Florida, USA 

EF133485 

Onchidiidae 

One hide lla sp. 

Genbank 

DQ093511 * 

Stylommatophora 

Charopidae 

Hedleyoconcha delta (Pfeiffer, 1857) 

Genbank 

AF033693* 

Siphonariidae 

Siphonaria serraia (Fischer, 1807) 

South Africa 

EF133486 

Siphonariidae 

Siphonaria conciwia Sowerby,1824 

South Africa 

EF133487 

Siphonariidae 

Siphonaria zelandica (Quoy & Gaimard, 1832) 

Genbank 

AF033713* 

Amphibolidae 

Salinator solida (Schacko, 1878) 

Genbank 

AF033712* 

Eupulmonata 

Ellobiidae 

Ophicardelus ornatus (Ferussac, 1821) 

Genbank 

AF033707* 

“basal'’ Heterobranehia/Triganglionata 

Pyramidelloidea 

Pyramidellidae Turbonilla lactea (Linne, 1758) 

Roscoff, France 

EF133488 

Pyramidellidae 

Turbonilla sp. 

Wellington, New Zealand 

EF133489 

Architectonicoidea 
Architectonic idae 

Heliacus variegatus (Gmelin, 1791) 

Tropical aquarium (ZSM 20012193) 

EF133490 

Architectonicidae 

Philippea lutea (Lamarck, 1822) 

Genbank 

AF033708* 

Valvatoidea 

Comirostridae 

Cornirostra pellucida (Laseron, 1954) 

Genbank 

AF033685* 

Orbitestellidae 

Orbitestella vera Powell, 1940 

Wellington, New Zealand 

EF561623 

Orbitestellidae 

Or bites tel la sp. 

Leigh, New Zealand 

EF561624 

Omalogyroidea 

Omalogyridae 

Omalogy ra burdwoodiana Strebel, 1908 

Antarctic (ZSM Mol-20021228) 

EF133491 

Rissoelloidea 

Rissoelidae 

Rissoella elongatospira Ponder, 1966 

Wellington, New Zealand 

EF561622 

Rissoelidae 

Rissoella micra Finlay, 1924 

Wellington, New Zealand 

EF561620 

Rissoelidae 

Rissoella cystophora Finlay, 1924 

Wellington, New Zealand 

EF561621 


2.2. DNA amplification and sequencing 

The following degenerated primers were used: H3-F: 5'- 
ATG GCT CGT ACC A AG CAG AC(ACG) GC-3’ and 
H3-R: 5’-ATATCC TT(AG) GGC AT(AG) AT(AG) GTG 
AC-3’ (Colgan et al. 1998) and produced a 246 bp prod¬ 
uct. The PCR profile was as follows: 95 °C for 5 min, fol¬ 
lowed by 35 cycles of 30 s at 95 °C, 25 s at 52 °C, 45 s 
at 72 °C and a final extension at 72 °C for 5 min and Taq 
Polymerase, recombinant from Invitrogcn (Karlsruhe, 
Germany) was used. All products were purified using the 
QIAquick Gel Extraction Kit from Qiagen (Hilden, Ger¬ 
many) and sequenced in both directions with a CEQ 2000 
Beckmann Coulter using the CEQ DTCS Quick Start Kit 


(Krcfeld, Germany). First we sequenced only one frag¬ 
ment per specimen. Moreover, to avoid mistakes accord¬ 
ing to the conservative character of the H3 gene, we se¬ 
quenced a random sample of 5 species a second time 
whereby no varieties could be detected. 

2.3. Sequence alignment 

Sequences were aligned manually using the software pack¬ 
age BioEdit version 7.0.5 (Hall 1999). The 113 DNA se¬ 
quences were translated into the amino acid sequences in 
GeneDoc version 2.6.002 (Nicholas & Nicholas 1997). 
The alignment is available from the authors upon request. 
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2.4. Statistical tests 

Codon usage statistics were calculated using GCUA ver¬ 
sion 1.2 (Me Inerney 1997). The purpose of this function 
is to calculate the Number (N) of times a particular codon 
is observed in an alignment and also to calculate the Rel¬ 
ative Synonymous Codon Usage (RSCU) values for the 
dataset. RSCU values define the number of times a par¬ 
ticular codon is observed relative to the number of times 
that the codon would be observed in the absence of any 
codon usage bias. Without any codon usage bias, the 
RSCU value would be 1.00. A codon that is used less fre¬ 
quently than expected will have a value of less than 1.00 
and a codon that is used more frequently than expected 
will have a volume of more than 1.00 (Me Inerney 1997). 

The degree of bias (Xx 2 / R ) which is the sum of the y} val¬ 
ues for the individual amino acids divided by the total 
number of inferred residues (n) for the combination of da¬ 
ta from all species (Shields et al. 1988) was determined. 

The substitution saturation was calculated for all 3 codon 
positions using the method developed by Xia et al. (2003) 
implemented in the software package DAM BE version 
4.2.13 (Xia & Xie 2001). 

2.5. Phylogenetic reconstruction 

Appropriate models for the analyses were selected after 
running Modcltest version 3.4 (Posada & Crandall 
1998) and using the Akaikc information criterion (AIC) 
(see tab. 2). 

The following analyses were conducted using PAUP* ver¬ 
sion 4.0 b 10 (Swofford, 2002) (settings: heuristic search 
strategy; tbr; gaps were treated as fifth bases): a) Maxi¬ 
mum parsimony for all data and b) Maximum likelihood 
for all data. 


Bootstrapping (Felsenstein 1985) was performed for 
maximum parsimony with 1000 replicates and for maxi¬ 
mum likelihood with 100 replicates. 

The following analyses were conducted using MrBayes 
version 3.1.2 (Ronquist & Huelsenbeck 2003): Bayesian 
inference: a) all data (with one model for all three codon 
positions), b) all data (with codon specific models) and 
e) only codon position one and two (third codon position 
excluded; with one model for codon position one and two). 

For Bayesian inference a Metropolis Chain Monte Carlo 
analysis with four chains and 1 000 000 generations was 
performed with the first 1000 trees ignored as bum-in. 

3. RESULTS 
3.1. Statistical tests 

The sequences provided a total of 246 sites of which 105 
were variable and 96 parsimony informative. Twenty-four 
(of 82) first base positions were variable as were 78 of the 
third base positions but only 3 of the second base posi¬ 
tions. Insertion/dclction events (indels) were not observed 
in any of the groups. The amino acid alignment had no 
sites that were parsimony-informative within the Hetero¬ 
branchia. 

H3 analyses showed a high codon usage bias (Tab. 3). The 
bias was principally against the use of A and U in the third 
codon position, x 2 tests were performed for all amino acids 
and revealed that the null hypothesis which is the expect¬ 
ed equal usage of the codons can be rejected for all amino 
acids with a significance level of 0,001 excepting histi¬ 
dine (p<0.05). For aspartic acid the null hypothesis can 
not be rejected (p 0.21). The degree of bias (2x 2 Av) 
showed a high value of 0,617. 


Tabic 2. Information on used models. 


Codon-Position 

1 st Position 
2 nd Position 
3 rd Position 
1 st and 2 nd Position 


Model 

Gamma distribution 

GTR+1 

«=equal 

TVMef+I 

a=equal 

TVM+G 

ct=0.8835 

GTR+I+G 

a--0.9167 

GTR+I+G 

a= 1.0265 


parameter Proportion of invariable sites 

Pinvar=0.5692 
Pin var=0.8647 
Pinvai-equal 
Pi nvai-0.6909 
Pinvai-0.5408 


l bt , 2 nd and 3 rd Position 
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Table 3. Codon Usage Bias. N = number of times a particular eodon is observed in a dataset (alignment). RSCU values num¬ 
ber of times a particular codon is observed, relative to the number of times that the eodon would be observed in the absence of any 
eodon usage bias. Amino acids (AA) are indicated by the three letter abbreviations. 


AA 

Codon 

N 

RSCU 

AA 

Codon 

N 

RSCU 

Phe 

uuu 

6 

(0.08) 

Ser 

ucu 

45 

(1.38) 

Phe 

uuc 

142 

(1.92) 

Ser 

ucc 

34 

(1.04) 

Leu 

UUA 

3 

(0.05) 

Ser 

UCA 

13 

(0.40) 

Leu 

UUG 

43 

(0.66) 

Ser 

UCG 

8 

(0.24) 

Tyr 

UAU 

2 

(0.04) 

Cys 

UGU 

0 

(0.00) 

Tyr 

UAC 

96 

(1.96) 

Cys 

UGC 

0 

(0.00) 

Ter 

UAA 

0 

(0.00) 

Tip 

UGG 

0 

(1.00) 

Ter 

UAG 

0 

(0.00) 

Pro 

ecu 

83 

(1.36) 

Ter 

UGA 

0 

(0.00) 

Pro 

ccc 

107 

(1.75) 

Leu 

CUU 

48 

(0.74) 

Pro 

CCA 

42 

(0.69) 

Leu 

cue 

81 

(1.24) 

Pro 

CCG 

13 

(0.21) 

Leu 

CUA 

2 

(0.03) 

Arg 

CGU 

233 

(2.59) 

Leu 

CUG 

214 

(3.28) 

Arg 

CGC 

127 

(1.41) 

His 

CAU 

31 

(1.27) 

Arg 

CGA 

18 

(0.20) 

His 

CAC 

18 

(0.73) 

Arg 

CGG 

9 

(0.10) 

Gin 

CAA 

41 

(0.28) 

TIu- 

ACU 

53 

(0.87) 

Gin 

CAG 

252 

(1.72) 

Tln' 

ACC 

146 

(2.38) 

lie 

AUU 

21 

(0.43) 

Thr 

ACA 

45 

(0.73) 

lie 

AUC 

126 

(2.57) 

Thr 

ACG 

1 

(0.02) 

He 

AUA 

0 

(0.00) 

Ser 

AGU 

8 

(0.24) 

Asn 

AAU 

0 

(0.00) 

Ser 

AGC 

88 

(2.69) 

Asn 

AAC 

0 

(0.00) 

Arg 

AGA 

74 

(0.82) 

Lys 

AAA 

135 

(0.61) 

Arg 

AGG 

78 

(0.87) 

Lys 

AAG 

306 

(1.39) 

Ala 

GCU 

201 

(1.49) 

Val 

GUU 

9 

(0.18) 

Ala 

GCC 

279 

(2.07) 

Val 

GUC 

89 

(1.82) 

Ala 

GCA 

47 

(0.35) 

Val 

GUA 

4 

(0.08) 

Ala 

GCG 

12 

(0.09) 

Val 

GUG 

94 

(1.92) 

Gly 

GGU 

32 

(0.87) 

Asp 

GAU 

38 

(0.78) 

Gly 

GGC 

38 

(1.03) 

Asp 

GAC 

60 

(1.22) 

Gly 

GGA 

72 

(1.96) 

Glu 

GAA 

69 

(0.70) 

Gly 

GGG 

5 

(0.14) 

Glu 

GAG 

128 

(1.30) 

Met 

AUG 

49 

(1.00) 


3.2. Phylogenetic analyses 


The consistency index in maximum parsimony analyses 
was low (0,210) as well as the retention index (0.444). Us¬ 
ing the method developed by Xia et al. (2003) a substi¬ 
tution saturation was observed in the third codon position 
(I ss 0,543 > I ssx 0,298). To support this observation the en¬ 
tropy for each position in the alignment was calculated us¬ 
ing BioEdit Version 7.0.5.2. Codon position one showed 
an average entropy value of 0.09 whereas the value for 
the second codon position was 0,009 and 0,69 for the third 
codon position. If the nucleotides occur more or less equal¬ 
ly at a certain position within the alignment the entropy 
is highest with the value of 1,36. 


The maximum parsimony 50 % majority-rule consensus 
tree (Fig. 1) showed all genera (Aplvsia, Elysia, Onchidel- 
la, Siphonaria, Orbitestellci , Turbonilla and Rissoella) re¬ 
covered as monophyletic. However, some of the bootstrap 
supports were low and there was no bootstrap support for 
a monophyletic Rissoella. Beyond the genera level, only 
Architectonicoidea were detected as monophyletic. All 
other nodes lacked support. 

The genera Orbitestella , Turbonilla , Onchidella and 
Aplvsia and again, the Architectonoicoidca were found to 
be monophyletic in the maximum likelihood analyses. The 
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Fig. 1 . 50 % majority-rule consensus tree of maximum parsimony of 14 most parsimonious trees based on histone H3 data set 
(nucleotides), number of parsimony informative characters = 96, consistency index (Cl) = 0,210. 
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Fig. 2. 50 % majority rule consensus Bayesian inference cladogram for the histone H3 dataset (based on nucleotides); Bayesian 
posterior probabilities provided at the branches. 
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Fig. 3. 50 % majority rule consensus Bayesian inference phylogram for the histone H3 dataset (based on nucleotides, 3 rd codon 
position excluded); Bayesian posterior probabilities provided at the branches. 
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basis of the 50 % majority-rule bootstrap tree (tree not 
shown) resembled a comb. There was no resolution of the 
deep nodes. 

The 50 % majority-rule consensus Bayesian inference 
cladogram (with one model for all three codon positions) 
(Fig. 2) also recovered all genera as monophyletic. Only 
the Bayesian posterior probability for Siphonaria was low 
(0.58) because only values above 0.95 are statistically sig¬ 
nificant. Beside the genera level Architectonicoidea were 
monophyletic and the Caenogastropoda formed a clade to¬ 
gether with OuiaJogyra. All other nodes had no statisti¬ 
cally significant support. 

The 50 % majority-rule consensus Bayesian inference 
cladogram (with codon specific models) (tree not shown) 
was quite similar to Figure. 2. All genera except Rissoel- 
la and Architectonicoidea were found to be monophyle¬ 
tic. The remaining nodes were supported by Bayesian pos¬ 
terior probabilities below 0.95. 

The 50 % majority rule consensus Bayesian inference phy- 
logram (with 3 rd codon position excluded and with one 
model for codon position one and two) (Fig. 3) recovered 
only the genera RissoeUa and Tnrbouilla as monophylet¬ 
ic while the Caenogastropoda together with OuiaJogyra 
were grouped separately from the rest of the taxa. All oth¬ 
er nodes had no statistically significant support. 

4. DISCUSSION 

Molecular investigations of deep-level relationships with¬ 
in the Gastropoda have been made difficult due to a lack 
of slowly evolving genes. Hence, a number of different 
markers have been utilized to solve this problem. Analy¬ 
ses of nuclear genes like the 28S ribosomal RNA and/or 
the 18S ribosomal RNA have provided a number of im¬ 
portant insights into gastropod relationships at several lev¬ 
els (Tillier et al. 1994, 1996; D ayrat et al. 2001; Von- 
nemann et al. 2005). The same applies to mitochondrial 
genes like the 16s ribosomal RNA (Thollesson et al. 
1999) or the cytochrome oxidase subunit 1 (Remigio & 
Hebert 2003). Colgan et al. (2000, 2003) used the his¬ 
tone H3 protein in combination with other genes to clar¬ 
ify gastropod molecular evolution. However, many aspects 
of gastropod phylogeny remain unclear such as the mo¬ 
lecular confirmation of the Heterobranchia concept based 
on morphological studies from Haszprunar (1985, 
1988). At the moment there is no comprehensive molec¬ 
ular study of heterobranch phylogeny especially one in¬ 
cluding the “basal” taxa (e. g. Pyramidelloidea, Architec¬ 
tonicoidea, Valvatoidea, Omalogyroidea and Rissoel- 
loidea). 


In this study we wanted to present a primary molecular 
insight into hctcrobranch phylogeny while simultaneous¬ 
ly testing the utility of the gene coding for the highly con¬ 
served protein histone H3 for resolving the deeper nodes 
within this taxon. 

Unfortunately, the present study did not provide a robust 
phylogenetic hypothesis for the relationships among dif¬ 
ferent lineages of Heterobranchia based on 113-Gencse- 
qucnces. Neither the monophyly of the Euthyncura nor a 
step-by-step evolution by the “basal” groups was support¬ 
ed. A conclusion about the monophyly of Opisthobranchia 
and Pulmonata could not be extracted from our data be¬ 
cause we did not have any resolution at this point. 

The first to investigate the value of histone H3 were Col¬ 
gan et al. (1998). They wanted to combine small nuclear 
ribonucleic acid U2 data and histone H3 to investigate 
arthropod molecular evolution. However, partitioned da¬ 
ta for H3 and U2 were incongruent according to Incon¬ 
gruence Length Difference tests. Using H3 data only, 
anomalous nodes appeared in their phylogenies while 
some possessed decay indices of 1. Therefore, their data 
were not sufficient to clarify relationships within major 
arthropod groups. 

Brown et al. (1999) investigated the DNA sequence da¬ 
ta of 34 Polychaeta species for partial histone 113, U2 snR- 
NA and two segments of 28S rDNA (D1 and D9-10 ex¬ 
pansion regions). When using 113 only. Brown et al. 
(1999) found a lack of concordance with morphological 
results and argued that the inclusion of all H3 data is in¬ 
appropriate for the phylogenetic levels under investiga¬ 
tions. 

Colgan et al. (2000) and later Colgan et al. (2003) used 
partial histone H3 (327bp) for the investigation of gastro¬ 
pod phylogeny. In Colgan et al. (2000), where the authors 
used 36 sequences of histone 113 only, using the chiton 
IschnocJiiton australis as an outgroup, no clades were re¬ 
tained in the bootstrap analyses. 113 with the third codon 
position excluded, retained only the higher Vetigastropo- 
da (bootstrap support = 68 %). 

In Colgan et al. (2003) in which H3 alone was used to 
recover phylogenetic relationships within Gastropoda, on¬ 
ly the clade of the Patel logastropoda with a support of 52 
% was recovered. When the third codon position was ex¬ 
cluded, none of the expected groups were recovered. 

Okuso et al. (2003) were the first to apply DNA sequence 
data to reconstruct the phylogeny of the molluscan class 
Polyplacophora. Their use of 59 sequences of histone 113 
resolved deeper nodes than the mitochondrial genes did 
while the strict consensus tree nested the two Gastropo- 
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da Viviparus georgicums and Siphonavia pectinata with¬ 
in Polyplacophora. 

A combined approach to the phylogeny of Cephalopoda 
(Mollusca) using 18S rRNA, 28S rRNA, histone 113 and 
COl underscored the aim of the study presented by Lind- 
gren et al. (2004). The strict consensus tree for the over¬ 
all optimal parameter set for 66 sequences of histone H3 
alone did not show monophyly for any classes investigat¬ 
ed. 

Kjbr et al. (2006) investigated the molecular phylogeny 
of Hexapoda (supermatrix approach with 137 taxa; 
375bp). They only recovered one ordinal level node when 
using only H3. 

According to our phylogenetic results and the results in 
the papers listed above histone H3 alone provides no new 
marker for studying deep molecular evolution of the Het¬ 
erobranchia due to the high grade of conservation and the 
low phylogenetic signal for deeper nodes. 

There were several indices defined by the results of our 
statistical tests supporting this assumption. We observed 
a high codon usage bias (sec tab. 2) in our alignment which 
was also indicated by an increasing frequency of C- and 
G-cnding codons and fewer A- and U-cnding codons 
(Shields et al. 1988). High C+G content at silent sites re¬ 
flects the effect of selection (Shiflds et al. 1988) while 
selective constraints against certain codons might reduce 
the amount of phylogenetic noise caused by synonymous 
substitution at cither first or third codon positions 
(Brown et al. 1999). However, our data suggest that a bias 
in eodon usage will not necessarily be indicative of the 
phylogenetic utility of a sequence. Despite a high eodon 
usage bias our computed phylogenetic trees showed a poor 
resolution of the deeper nodes. An explanation for this 
could be that the pressure to obtain the favoured codon 
had partially obscured the phylogenetic signal. Colgan 
et al. (1998,2000) made similar observations and conclud¬ 
ed that apparent, high codon-usage bias as found for the 
H3 data does not necessarily result in high phylogenetic 
consistency for DNA sequences. In the studies presented 
by Brown et al. (1999) a lack of agreement of the H3 
analyses with morphology occurs despite very high codon 
usage bias. They concluded that whilst selective con¬ 
straints may have reduced the absolute rate of synonymous 
substitutions, the pressure in favor of (homoplastic) resti¬ 
tution of the favoured eodon has at least a partially ob¬ 
scured phylogenetic signal. Hence, codon usage bias does 
not necessarily mean that a gene sequence will be phylo- 
genctieally useful. 

The degree of bias (^X 2/n ) showed a high value of 0,617. 
It was similar to the values observed in gastropods (0.60) 


(Colgan et al. 2000) and polychaetes (0.665) (Brown et 
al. 1999) and higher than the values of Drosophila 
melanogaster (Fitch & Strausbaugh 1993) and arthro¬ 
pods (0.37) (Colgan et al. 1998). 

Another indication suggesting the problems of 113 as a 
marker for studying deep molecular evolution was the high 
grade of conservation indicated by the lack of parsimo¬ 
ny-informative sites in the amino acid alignment. 

Additional important evidence was the observed substi¬ 
tution saturation at the third codon position. A saturation 
is caused by multiple-hits which render homoplasious 
changes. Homoplasy on the basis of saturation in substi¬ 
tution is one of the major problems in molecular phylo¬ 
genetics (Tillier et al. 1996). This problem generally be¬ 
comes more relevant at progressively higher taxonomic 
levels (Boore & Brown 1998). If numerous substitutions 
occur at the same position, a hiding or completely eras¬ 
ing of the ancient phylogenetic signal could be the result 
(Lopez et al. 1999). In order to avoid a decrease of the 
phylogenetic information contained in the sequences, we 
excluded the third codon position in further analyses. 
However, this was the position with the most variable sites 
(78 of 82 positions) in our data set. With the exclusion 
there was no phylogenetic information left for a resolu¬ 
tion of the deeper nodes. Trees which resemble combs at 
the base resulted (see fig. 3). The entropy was calculated 
for each position in the alignment to further assess the in¬ 
fluence of the 3 codon positions. A high entropy value im¬ 
plies that the nucleotides occur almost equally at this po¬ 
sition within the alignment. An almost equal distribution 
of the nucleotides at one position indicates a less selec¬ 
tive constraint allowing a higher frequency of substitu¬ 
tions. The average values for each of the three codon po¬ 
sitions indicate a less selective constraint for the third 
codon position hence supporting the previous result of this 
codon position being saturated. 

It is questionable if given a larger data set, the noise will 
eventually succumb to the signal. There are few examples 
where the expansion of an ambiguous data set has result¬ 
ed in a convincing phylogeny (Boore & Brown 1998). 

Surprisingly, there was a good resolution on genera lev¬ 
el. Analyses conducted with maximum parsimony and 
Bayesian inference (with all data) recovered all (or near¬ 
ly all) genera mostly with statistically significant support¬ 
ed nodes. However, our findings should be consider pre¬ 
liminary and further studies including more genera arc 
necessary to test the possible utility of histone H3 for the 
resolution of recent splits. In other studies (Colgan et al. 
1998; Onuso et al. 2003; Lindgren et al. 2004) some (but 
not all) genera were found to be monophyletic but due to 
the question the authors intended to answer, their studies 
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in regards to taxon sampling lacked genera represented by 
more than one species. 

In conclusion, still slowly evolving genes for the resolu¬ 
tion of the deeper nodes in gastropod phylogeny are miss¬ 
ing. To test the Heterobranehia concept as outlined by 
Haszprunar (1988) other markers have to be found with 
sufficient variability but no substitution saturation. 
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